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Background of Invention 

[0001] The technical field for the claimed invention is an applied method of automated 
data manipulation by use of an electronic spreadsheet system or the like. 

[0002] Spreadsheet software, since its introduction with the advent of personal 
computers, is a popular computerized method of transforming raw data into 
meaningful information. With the use of a spreadsheet, which is a two dimensional 
rectangular grid made up of a finite number of rows and columns and where the 
intersection of each row and column is referred to as a cell, the user may enter, store 
and manipulate data. 

[0003] 

Although there has been an astonishing improvement in computer hardware (i.e. 
memory and speed) and software in the past decade, spreadsheet software capability 
appears to have reached a bottleneck due to the time-consuming iterative step-by- 
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step manual spreadsheet data operations. The user is often required to locate the 
target cells (source and/or destination cells) and perform spreadsheet data operations 
repetitiously in order to transform the raw data. 

[0004] Presently, there is no known technique in the electronic spreadsheet environment 
to improve this bottleneck. The challenge has been to overcome the difficulties in 
developing a generalized automated data manipulation tool. These difficulties are 
mainly due to a lack of method for (a) handling spreadsheet databases that have 
different format or structure, (b) identifying groups or sets of information or data that 
require the same type of spreadsheet data operation (i.e. copy, move, insert, etc.), and 
(c) allowing the user to instruct the computer to carry out all the desired operations 
throughout the entire database automatically. 

Summary of Invention 

[0005] The present invention (INV) provides a base to perform an automatic method of 
data manipulation in an electronic spreadsheet system or the like. 

[0006] This INV successfully overcomes the lack of method for (a) handling spreadsheet 
databases that have different format or structure, (b) identifying groups or sets of 
information or data that require the same type of spreadsheet data operation (i.e. 
copy, move, insert, etc.), and (c) allowing the user to instruct the computer to carry 
out all the desired operations throughout the entire database automatically, by 
combining existing spreadsheet capabilities with the new method and concepts that 
are derived from this INV. 

[0007] With this INV, spreadsheet capability can be extended to allow a simple user 

instruction method to: (1) perform the same kind of spreadsheet data operation(s), 
such as copy, paste, insert, delete, move, etc., automatically and (2) save a sequence 
of data operations in a 'batch job' and perform the same kind of data operations for 
any data file by re-executing the 'batch job'. 

[0008] Described below are the new concepts and logic that define the method used in 
this INV: 
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[0009] (1 ) Key Identifier (Key-ID): The Key-ID is a cell content identifier. It is simply a user 
defined symbol, number, alphanumeric character or text string (letter, word, phrase, 
etc.). The Key-ID is the entire content or part of the content of a cell or a group of 
cells that the user either identifies in the existing database or inserts into the 
database. The purpose of the Key-ID is to allow a computer program to overcome the 
difficulties in dealing with databases of different format or structure by separating the 
database into manageable data sets, in which operations can be performed 
systematically. 

[001 0] (2) Region Identifier (Region-ID): The Region-ID is a user specified region that 

contains the Key-ID in the database. The specified region can be defined using one or 
more parameters depending on where the region is located. If the region is located in 
the active spreadsheet, only one parameter is needed. In this case a Column or Row 
Identifier (RC-ID) can be used as the Region-ID. However, if the region extends to 
sheet(s) outside of the active sheet, the Region-ID requires additional parameters to 
specify the sheet name, file name, and/or directory name. 

[001 1] (3) Cell Collection Expression (CCE): The CCE uses two different identifiers- Key- 
ID and Region-ID in a format of Cell(Key-ID,Region-ID). The CCE is used to represent 
or identify all the cells, which contain the Key-ID in the region specified by the 
Region-ID. Each cell represented or identified by the CCE is called a CCE element. In 
this INV, each CCE element is a single cell containing the Key-ID within each data set. 

[001 2] (4) Logic for Systematic Automated Data Manipulation: The logic for systematic 
automated data manipulation is described in the following steps: 

[001 3] (a) Use the Key-ID to separate the database into several manageable data sets, 
which have the same data pattern. 

[0014] (b) Use the Region-ID to specify the region, which contains the Key-ID in the 
spreadsheet. 

[001 5] (c) Use a Cell Collection Expression (CCE) to represent or identify all the cells 

containing the Key-ID (each individual cell is a CCE element) in the specified region 
(Region-ID). 
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[0016] 



(d) Assign a standard coordinate system with origin (0,0) to the CCE element 
within each data set. 



(e) Given that every data set has the same data pattern, the relative locations of 
the target cells (source and/or destination cells) with respect to the CCE element 
origin will be the same for each data set. 

(f) With the user specified CCE and the fixed relative locations of the target cells 
(with respect to the origin), a spreadsheet data operation can easily be performed on 
one data set after another in a systematic manner throughout the entire database 
using existing spreadsheet features and macro capability. 

(g) The CCE, relative target cells, and type of spreadsheet data operation form a 
standard input format, which is independent of the number of data sets. This 
standard input format allows a series of user-specified data operations to be recorded 
and saved in a "batch job" for future re-execution on any data file. 

Brief Description of Drawings 

[0020] FIG 1 .0 shows an illustration of a personal computer system having a mouse, 
keyboard, full display screen, and CPU running an example electronic spreadsheet 
program. 

[0021] FIG 2.0 shows typical electronic spreadsheet windows comprising of a worksheet 
that has a number of rows and columns, which intersect thus creating the cells within 
the spreadsheet. 

[0022] FIG 3.0 shows a sample spreadsheet with a Key-ID in the original data file, that 
already has three separate data sets. 

[0023] FIG 3.1 shows data in a spreadsheet without an obvious Key-ID. 

[0024] FIG 3.2 shows the resulting data spreadsheet with a Key-ID inserted to separate 
the original database into three data sets. 

[0025] FIG 3.3 shows the flow chart of a macro program, which allows the user to insert a 
Key-ID on the basis of a cell content change. 



[0017] 



[0018] 



5 [001 9] 
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[0026] FIG 4.0 shows the difference between the Cell Collection Expression (CCE) and 
conventional spreadsheet reference style for cell identification. 

[0027] FIG 5.0 shows the use of the CCE to represent or identify the cells containing the 
Key-ID in each data set. 

[0028] FIG 5.1 shows the assignment of a standard coordinate system to Data Set 1 (as 
shown in FIG. 5.0) with the CCE element as origin (0,0). 

[0029] FIG 5.2 shows a subsequent assignment of a standard coordinate system to Data 
Set 2 (as shown in FIG. 5.0) with the CCE element as origin (0,0). 

[0030] FIG 6.0 shows an example spreadsheet in which a "Copy Paste" operation will be 
performed. 

[0031] FIG 6.1 shows the results of the "Copy Paste" operation performed on the data 
spreadsheet in Figure 6.0. 

[0032] FIG 6.2 shows a "Copy Paste" Operation Flow Chart for a Search Region in a User- 
specified Column. 

Detailed Description 

[0033] For a detailed understanding of the invention, reference is made to Figures 1 .0 
through 6.2. 

[0034] The Personal Computer System and Spreadsheet Application Program 

[0035] Figure 1 .0 shows an illustration of a typical computer system with a mouse (1), 
keyboard (2), display screen (3), and CPU (4) running an example electronic 
spreadsheet window (5). The mouse (1) and keyboard (2) allow the user to interface 
and input instructions for data manipulation in the example electronic spreadsheet 
and display the results on the display screen (3). 

[0036] The Conventional Spreadsheet Reference Style 

[0037] identjf j cat j 0n 0 f spreadsheet cells is necessary when applying spreadsheet 

data operations, since it is the cells in a spreadsheet that are typically used to store 
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input data. The conventional reference style for a spreadsheet cell is of the format Cell 
(Row No.,Column No.) or Cell(Column Letter Row No.). Figure 2.0 shows examples of 
electronic spreadsheets, where the intersection of rows and columns make up the 
cells in the spreadsheet. For example, the intersection of Row 2 and Column 1 forms 
Cell(2,1), while Column A and Row 2 make up Cell(A2). 

[0038] in addition, the conventional reference style can be used to select a range of cells 
with a general format of Range(Upper Left CelhLower Right Cell). Figure 2.0 shows an 
example range of cells highlighted with a dotted rectangle Range[(5,2):(7,3)] or Range 
(B5:C7). Although the conventional style allows a range of cells to be selected, the 
range cannot selectively group the cells based on the cells' content. 

[0039] The Key Identifier (Key-ID) 

[0040] The Key-ID is a cell content identifier. It is simply a user defined symbol, number, 
alphanumeric character or text string (letter, word, phrase, etc.). The user can specify 
the Key-ID to be the entire content or part of the content in a cell or group of cells. 
The Key-ID is either drawn from the original database or inserted by the user through 
manual spreadsheet data operations or through simple programming. 

[0041] The Key-ID is used to separate the data in a spreadsheet into data sets that have 
the same kind of data construct. The Key-ID will allow a computer program to 
overcome the difficulties in dealing with databases of different format or structure by 
separating the spreadsheet data into manageable data sets, in which operations can 
be performed automatically. 

[0042] Region Identifier (Region-ID) 

[0043] The Region-ID is a user specified region (containing the Key-ID), which can be 

within one or more spreadsheets, and can be defined by up to four location identifiers 
namely: (a) Row or Column Identifier (RC-ID), (b) Worksheet Identifier (Sheet-ID), (c) 
Spreadsheet File or Workbook Identifier (File-ID) and (d) Directory Identifier 
(Directory-ID). In specifying the RC-ID, the user must either specify columns with 
letters (A, B, C, E, etc.) or rows with numbers (1 , 2, 3, 4, etc.). Moreover, when the RC- 
ID is a wildcard ('*'), the specified region becomes the entire sheet. 
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[0044] The Region-ID is represented as: 

[0045] Region-ID = RC-ID, [Sheet-ID], [File-ID], [Directory-ID] 

[0046] where the RC-ID is a required parameter and the Sheet-ID, File-ID, and Directory- 
ID are optional and not limited to only one specification. This is illustrated using the 
following explanations: 

[0047] 1) If the user specified region is located in the active sheet, the Region-ID equals 
RC-ID. 

[0048] 2) If the user specified region extends to other worksheets besides the active 
sheet within the active file, all the designated worksheet name or names must be 
specified in the Sheet-ID. In this case: Region-ID = RC-ID, Sheet-ID. 

[0049] 3) If the designated worksheet(s) is/are not located in the active spreadsheet file, 
all the file and directory names need to be specified in the File-ID and Directory-ID, 
respectively, in this case: Region-ID - RC-ID, Sheet-ID, File-ID, Directory-ID. 

[0050] It is the incorporation of the Region-ID into the Cell Collection Expression, to be 
illustrated later by example, which enhances cell representation or identification. 

[0051] Key-ID Selection or Insertion 

[0052] Figures 3.0 through 3.2 illustrate the use of the Key-ID and its effectiveness by a 
few examples. For the purposes of illustration and ease of visualizing the automated 
data manipulation method, each example contains only a limited number of data sets, 
each of which contains a fairly simple pattern. Although in these examples, manual 
processing of the data is feasible with prior art operations. In actual applications of 
this method, data is stored in many files and typically contains a very large number of 
data sets, each of which is composed of a complex pattern, rendering manual 
processing impractical. 

[0053] 

Figure 3.0 shows an example data spreadsheet containing production data for 
three producing oil wells. In this example, the format of the data naturally supports 
the use of the text 'Well Name 1 as the Key-ID (6), since the text 'Well Name 1 already 
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identifies three different data sets with a similar data pattern. 

[0054] On the other hand, Figure 3.1 shows a data spreadsheet without an obvious Key- 
ID. Therefore a Key-ID has to be inserted into the spreadsheet to separate the 
database. In this example, the user inserts a new row containing the text 'Well Name' 
whenever there is a change of cell content in Column A, beginning from Row 2. Figure 
3.2 shows the results of the Key-ID insertion, in which 'Well Name' becomes the 
inserted Key-ID (7). This figure clearly illustrates how the insertion of the Key-ID has 
partitioned the database into three data sets. 

[0055] The insertion of the Key-ID can generally be automated through simple prior art 
macro execution. Figure 3.3 shows a flow chart of a sample macro program, which 
allows the user to insert the Key-ID based on the content change of a cell for a 
specified column. 

[0056] Cell Collection Expression (CCE) 

[0057] The CCE is a new concept and can be viewed as a generalized method of 

spreadsheet cell representation or identification, which incorporates the Key-ID and 
Region-ID. Cell identification with the conventional reference style is limited by the 
fact that a cell can only be identified using a Row No. and Column No. (e.g. Cell (2,1), 
Celi(l 2,5), etc.), or Column Letter and Row No. (e.g. Cell(A2), Cell(El 2), etc.), and a 
range of cells can only be identified by the use of a range in a format Range(Upper 
Left Cell:Lower Right Cell). The CCE removes this limitation by use of a single 
expression to represent or identify a collection of cells, which contain the Key-ID in 
the cells' content. 

[0058] A CCE of format Cell(Key-ID,Region-ID) is defined by the user, with one identifier 
as the Key-ID and the other as the Region-ID. The Region-ID specifies the region 
(containing the Key-ID), which is defined by the RC-ID and optional identifiers such as 
the Sheet-ID, File-ID, and Directory-ID. The CCE represents or identifies all those cells 
containing the Key-ID within the user specified Region-ID. Additionally, each cell 
represented or identified by the CCE is called a CCE element. 

[0059] Comparison - CCE vs. Conventional Spreadsheet Reference Style for Cell 
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Identification 

[0060] Figure 4.0 shows a few examples illustrating the differences between the CCE and 
the conventional spreadsheet reference style for cell identification. (8) shows the 
general spreadsheet display and cell contents. The conventional spreadsheet 
reference style identifies a cell or group of cells based on the intersection of the Row 
No.(s) (9) and Column No.(s) (10), or Row No.(s) (11) and Column Letter(s) (12). 

[0061] The comparison of cell identification between the CCE and conventional cell 
reference style is explained in (1 3) using the example cell diagrams in (8). 

[0062] In Example 1 , the CCE Cell('Name\B) refers to all cells in Column B, which contain 
the text 'Name'. This allows the CCE CellCName'^) to represent a collection of two 
different cells, both containing the text 'Name 1 , in a region defined by the RC-ID as 
Column B. While using the conventional reference style, each cell would have to be 
individually identified using notation such as Cell(l ,2)/Cell(Bl) and Cell(3,2)/Cell(B3). 
Note that the typical spreadsheet 'Range 1 notation cannot be used to represent this 
collection of cells. 

[0063] In Example 2, the CCE CellOName',3) refers to all the cells in Row 3, which contain 
the text 'Name 1 . In this case the CCE represents a collection of two different cells, 
containing the text 'Name 1 , in a region defined by the RC-ID as Row 3. The 
conventional reference style can identify these cells as Cell(3,l)/Cell(A3), Cell(3,2)/Cell 
(B3), or Range(A3:B3). Although Range(A3:B3) is a single expression identifying these 
cells, the CCE has greater flexibility in that the cells are identified using their content- 
'Name'. 

[0064] In Example 3, the CCE CellCName',*) refers to all the cells in the entire sheet, 

which have content 'Name 1 . In this example the Region-ID is defined by a RC-ID equal 
to '*', which is a wild card representing a region encompassing the entire sheet. The 
CCE CellCName 1 ,*) represents four conventional cell references: Cell(3,l)/Cell(A3), Cell 
(5,l)/Cell(A5), Cell(l ,2)/Cell(Bl), and Cell(3,2)/Cell(B3). Once again the typical 
spreadsheet 'Range 1 notation cannot be used to represent this collection of cells. 

[0065] lt js evident from these three simple examples, that the CCE has much more 
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flexibility in that it can selectively group the cells based on the cells 1 content. 

[0066] Data Partitioning and Processing Using the Key-ID, Region-ID and CCE 

[0067] The Key-ID allows the partitioning of the spreadsheet into data sets with the same 
data pattern. This is accomplished by first choosing an appropriate Key-ID, which 
identifies each data set. Next, the user specifies the Region-ID, which specifies the 
region containing the Key-ID in the database. The incorporation of the Key-ID and 
Region-ID in the CCE allows the representation or identification of all cells containing 
the Key-ID. Each of these cells is a CCE element, which will be used to specify the 
position of target cells for spreadsheet operations using relative coordinates. 

[0068] Figure 5.0 shows the use of the CCE to represent or identify all the Key-ID cell 

locations for each data set. In this example, the text Well Name' is contained in each 
data set and is a convenient choice for the Key-ID. The CCE Cell('Well Name'.A), as 
shown in (14), has a Key-ID-Well Name' and a Region-ID defined by RC-ID-Column A 
for the active sheet. The CCE represents the three cells in Column A (Cell(3,1 ), Cell 
(10,1), and Cell(l 6,1)) containing the text 'Well Name 1 (Key-ID). Each cell location, 
containing the Key-ID, is a CCE element to be used as a point of origin in the 
corresponding data set, and will form a base for automated spreadsheet operations. 

[0069] The assignment of a coordinate system with origin (0,0) to a CCE element, as 

applied to this example (shown in Figure 5.0), is illustrated in Figures 5.1 and 5.2. The 
Key-ID cell location contained in each data set is the CCE element for that data set. 
Given that every data set has the same data pattern, the relative locations of the target 
cells (source and/or destination cells) with respect to the data set origin (0,0) will be 
the same for each data set. The advantage of this method is that once the desired 
operations have been specified for a single data set, the same specifications can be 
automatically applied to all other subsequent data sets. 

[0070] 

Note that the coordinate system is assigned to the CCE element in each data set of 
interest. In Figure 5.1 , the data set of interest is Data Set 1 , so (0,0) corresponds to 
the cell location in Data Set 1 containing the Key-ID, namely Cell(3,l) in Figure 5.0. 
Operations performed on Data Set 1 are specified according to this relative coordinate 
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system. In Figure 5.2, Data Set 2 is the data set of interest, so (0,0) in this new 
coordinate system corresponds to Cell(lOJ) in Figure 5.0 and is the cell in Data Set 2 
containing the Key-ID. Operations performed on Data Set 2 will use this new 
coordinate system. The procedure will be the same for the remaining data set. As 
illustrated by these examples, if the relative location of the target cells for each data 
set is the same, then the user only has to define the CCE, relative locations of the 
target cells, and desired operation for one data set. All other subsequent data sets will 
be processed systematically. 

[0071] Data Manipulation Example 

[0072] The use of the CCE to manipulate a database will now be demonstrated by an 

example. Consider Figure 6.0, which shows the spreadsheet from Figure 5.0 with the 
Key-ID, Region-ID, and CCE as defined in the previous examples. The objective will be 
to copy the first production year (source) of each well for each data set and paste it to 
the location two cells to the right of the producer name (destination). 

[0073] First note that the source cell is located in the same column and three rows below 
the CCE element in each data set. Using the relative coordinate notation with respect 
to the CCE element, this is expressed as (3,0). The destination cell, located in the 
same row and three columns to the right of the CCE element, is expressed as (0,3). 

[0074] The data operations to be performed by the computer are as follows: 

[0075] (a) Locate the first data cell location represented by the CCE element in Data Set 1 , 
which contains the Key-ID-Weil Name'. 

[0076] (b) Assign a standard coordinate system so that the origin (0,0) is located at the 
CCE element of the current data set. 

[0077] (c) Copy the contents of the source cell at position (3,0), which is relative to the 
CCE element origin. 

[0078] (d) Paste the contents into the destination cell at position (0,3), which is relative to 
the CCE element origin. 
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[0079] (e) Locate the next CCE element and repeat (b) to (e) until all data sets have been 
processed. 

[0080] Figure 6.1 shows the result of the 'Copy Paste' operations in this example after all 
data sets have been processed. 

[0081] The last example clearly illustrates that once the CCE, spreadsheet operation and 
source and/or destination cells have been specified, all data sets can be automatically 
processed using built-in programs or macros. 

[0082] This automated data manipulation method for the 'Copy Paste' operations, for a 
user specified search region, is illustrated in the flow chart shown in Figure 6.2. 

[0083] Finally, to further elaborate on the innovative nature of this invention the CCE, 

target cells, and type of spreadsheet data operation can form a collection of standard 
input parameters, which are independent of the number and absolute location of the 
data sets. This standard input format allows a series of user-specified data operations 
to be recorded and saved as a 'batch job' for future execution on any data file 
containing data sets of the same format. 
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